Picture for Shashi Kumar

Shashi Kumar

Geometric Latent Reasoning Induces Shorter Generations in LLMs

Add code
Jun 01, 2026
Viaarxiv icon

Evaluation of Automatic Speech Recognition Using Generative Large Language Models

Add code
Apr 23, 2026
Viaarxiv icon

Closing the Speech-Text Gap with Limited Audio for Effective Domain Adaptation in LLM-Based ASR

Add code
Apr 07, 2026
Viaarxiv icon

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Add code
Mar 27, 2026
Viaarxiv icon

Nonparametric Variational Differential Privacy via Embedding Parameter Clipping

Add code
Mar 10, 2026
Viaarxiv icon

Reducing Prompt Sensitivity in LLM-based Speech Recognition Through Learnable Projection

Add code
Jan 28, 2026
Viaarxiv icon

Text-only adaptation in LLM-based ASR through text denoising

Add code
Jan 28, 2026
Viaarxiv icon

TokenVerse++: Towards Flexible Multitask Learning with Dynamic Task Activation

Add code
Aug 27, 2025
Viaarxiv icon

Unifying Streaming and Non-streaming Zipformer-based ASR

Add code
Jun 17, 2025
Figure 1 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 2 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 3 for Unifying Streaming and Non-streaming Zipformer-based ASR
Figure 4 for Unifying Streaming and Non-streaming Zipformer-based ASR
Viaarxiv icon

Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering

Add code
Jun 05, 2025
Figure 1 for Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Figure 2 for Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Figure 3 for Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Viaarxiv icon